Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 34857 |
| Missing cells | 100975 |
| Missing cells (%) | 13.8% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 5.6 MiB |
| Average record size in memory | 168.0 B |
Variable types
| Categorical | 8 |
|---|---|
| Numeric | 13 |
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
SUBURB has a high cardinality: 351 distinct values | High cardinality |
ADDRESS has a high cardinality: 34009 distinct values | High cardinality |
SELLERG has a high cardinality: 388 distinct values | High cardinality |
DATE has a high cardinality: 78 distinct values | High cardinality |
ROOMS is highly correlated with BEDROOM2 | High correlation |
BEDROOM2 is highly correlated with ROOMS | High correlation |
PRICE has 7610 (21.8%) missing values | Missing |
BEDROOM2 has 8217 (23.6%) missing values | Missing |
BATHROOM has 8226 (23.6%) missing values | Missing |
CAR has 8728 (25.0%) missing values | Missing |
LANDSIZE has 11810 (33.9%) missing values | Missing |
BUILDINGAREA has 21115 (60.6%) missing values | Missing |
YEARBUILT has 19306 (55.4%) missing values | Missing |
LATTITUDE has 7976 (22.9%) missing values | Missing |
LONGTITUDE has 7976 (22.9%) missing values | Missing |
LANDSIZE is highly skewed (γ1 = 96.02231136) | Skewed |
BUILDINGAREA is highly skewed (γ1 = 99.13257937) | Skewed |
ADDRESS is uniformly distributed | Uniform |
CAR has 1631 (4.7%) zeros | Zeros |
LANDSIZE has 2437 (7.0%) zeros | Zeros |
Reproduction
| Analysis started | 2022-12-15 05:18:48.210489 |
|---|---|
| Analysis finished | 2022-12-15 05:19:27.042202 |
| Duration | 38.83 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 351 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 272.4 KiB |
| Reservoir | 844 |
|---|---|
| Bentleigh East | 583 |
| Richmond | 552 |
| Glen Iris | 491 |
| Preston | 485 |
| Other values (346) |
Length
| Max length | 18 |
|---|---|
| Median length | 9 |
| Mean length | 9.819175488 |
| Min length | 3 |
Characters and Unicode
| Total characters | 342267 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Abbotsford |
|---|---|
| 2nd row | Abbotsford |
| 3rd row | Abbotsford |
| 4th row | Abbotsford |
| 5th row | Abbotsford |
| Value | Count | Frequency (%) |
| Reservoir | 844 | 2.4% |
| Bentleigh East | 583 | 1.7% |
| Richmond | 552 | 1.6% |
| Glen Iris | 491 | 1.4% |
| Preston | 485 | 1.4% |
| Kew | 467 | 1.3% |
| Brighton | 456 | 1.3% |
| Brunswick | 444 | 1.3% |
| South Yarra | 435 | 1.2% |
| Hawthorn | 428 | 1.2% |
| Other values (341) | 29672 |
| Value | Count | Frequency (%) |
| east | 2735 | 5.7% |
| north | 1795 | 3.7% |
| south | 1398 | 2.9% |
| west | 1084 | 2.3% |
| melbourne | 1053 | 2.2% |
| bentleigh | 902 | 1.9% |
| park | 885 | 1.8% |
| brunswick | 877 | 1.8% |
| brighton | 849 | 1.8% |
| reservoir | 844 | 1.8% |
| Other values (296) | 35685 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 30789 | 9.0% |
| r | 28896 | 8.4% |
| o | 28824 | 8.4% |
| n | 24963 | 7.3% |
| a | 23548 | 6.9% |
| t | 20717 | 6.1% |
| l | 19189 | 5.6% |
| i | 15833 | 4.6% |
| s | 15495 | 4.5% |
| 13250 | 3.9% | |
| Other values (39) | 120763 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 280783 | |
| Uppercase Letter | 48234 | 14.1% |
| Space Separator | 13250 | 3.9% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 30789 | |
| r | 28896 | |
| o | 28824 | |
| n | 24963 | |
| a | 23548 | 8.4% |
| t | 20717 | 7.4% |
| l | 19189 | 6.8% |
| i | 15833 | 5.6% |
| s | 15495 | 5.5% |
| h | 11319 | 4.0% |
| Other values (15) | 61210 |
| Value | Count | Frequency (%) |
| B | 5069 | 10.5% |
| M | 4335 | 9.0% |
| E | 4089 | 8.5% |
| S | 3679 | 7.6% |
| C | 3605 | 7.5% |
| H | 3479 | 7.2% |
| P | 3107 | 6.4% |
| N | 2805 | 5.8% |
| W | 2714 | 5.6% |
| A | 2309 | 4.8% |
| Other values (13) | 13043 |
| Value | Count | Frequency (%) |
| 13250 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 329017 | |
| Common | 13250 | 3.9% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 30789 | 9.4% |
| r | 28896 | 8.8% |
| o | 28824 | 8.8% |
| n | 24963 | 7.6% |
| a | 23548 | 7.2% |
| t | 20717 | 6.3% |
| l | 19189 | 5.8% |
| i | 15833 | 4.8% |
| s | 15495 | 4.7% |
| h | 11319 | 3.4% |
| Other values (38) | 109444 |
| Value | Count | Frequency (%) |
| 13250 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 342267 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 30789 | 9.0% |
| r | 28896 | 8.4% |
| o | 28824 | 8.4% |
| n | 24963 | 7.3% |
| a | 23548 | 6.9% |
| t | 20717 | 6.1% |
| l | 19189 | 5.6% |
| i | 15833 | 4.6% |
| s | 15495 | 4.5% |
| 13250 | 3.9% | |
| Other values (39) | 120763 |
| Distinct | 34009 |
|---|---|
| Distinct (%) | 97.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 272.4 KiB |
| 5 Charles St | 6 |
|---|---|
| 25 William St | 4 |
| 33 McCracken St | 3 |
| 16 Smith St | 3 |
| 57 Bay Rd | 3 |
| Other values (34004) |
Length
| Max length | 27 |
|---|---|
| Median length | 13 |
| Mean length | 13.55136701 |
| Min length | 8 |
Characters and Unicode
| Total characters | 472360 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 33201 ? |
|---|---|
| Unique (%) | 95.2% |
Sample
| 1st row | 68 Studley St |
|---|---|
| 2nd row | 85 Turner St |
| 3rd row | 25 Bloomburg St |
| 4th row | 18/659 Victoria St |
| 5th row | 5 Charles St |
| Value | Count | Frequency (%) |
| 5 Charles St | 6 | < 0.1% |
| 25 William St | 4 | < 0.1% |
| 33 McCracken St | 3 | < 0.1% |
| 16 Smith St | 3 | < 0.1% |
| 57 Bay Rd | 3 | < 0.1% |
| 7 Hope St | 3 | < 0.1% |
| 13 Robinson St | 3 | < 0.1% |
| 3 Charles St | 3 | < 0.1% |
| 12 Grandview Av | 3 | < 0.1% |
| 1088 Toorak Rd | 3 | < 0.1% |
| Other values (33999) | 34823 |
| Value | Count | Frequency (%) |
| st | 17238 | 16.4% |
| rd | 6592 | 6.3% |
| av | 3395 | 3.2% |
| ct | 1743 | 1.7% |
| dr | 1266 | 1.2% |
| cr | 1171 | 1.1% |
| gr | 733 | 0.7% |
| 3 | 695 | 0.7% |
| 5 | 671 | 0.6% |
| 4 | 656 | 0.6% |
| Other values (12873) | 70896 |
Most occurring characters
| Value | Count | Frequency (%) |
| 70199 | 14.9% | |
| t | 29720 | 6.3% |
| e | 24774 | 5.2% |
| r | 22874 | 4.8% |
| a | 22075 | 4.7% |
| S | 19703 | 4.2% |
| n | 18967 | 4.0% |
| 1 | 18081 | 3.8% |
| o | 17069 | 3.6% |
| l | 16609 | 3.5% |
| Other values (54) | 212289 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 238995 | |
| Decimal Number | 81436 | 17.2% |
| Uppercase Letter | 71535 | 15.1% |
| Space Separator | 70199 | 14.9% |
| Other Punctuation | 10195 | 2.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| S | 19703 | |
| R | 8445 | |
| C | 6463 | 9.0% |
| A | 6076 | 8.5% |
| B | 3679 | 5.1% |
| M | 3246 | 4.5% |
| D | 2920 | 4.1% |
| P | 2782 | 3.9% |
| G | 2649 | 3.7% |
| W | 2547 | 3.6% |
| Other values (16) | 13025 |
| Value | Count | Frequency (%) |
| t | 29720 | |
| e | 24774 | |
| r | 22874 | |
| a | 22075 | |
| n | 18967 | 7.9% |
| o | 17069 | 7.1% |
| l | 16609 | 6.9% |
| d | 14673 | 6.1% |
| i | 13090 | 5.5% |
| s | 8861 | 3.7% |
| Other values (16) | 50283 |
| Value | Count | Frequency (%) |
| 1 | 18081 | |
| 2 | 12537 | |
| 3 | 9751 | |
| 4 | 7861 | |
| 5 | 6911 | 8.5% |
| 6 | 5934 | 7.3% |
| 7 | 5454 | 6.7% |
| 0 | 5328 | 6.5% |
| 8 | 5091 | 6.3% |
| 9 | 4488 | 5.5% |
| Value | Count | Frequency (%) |
| 70199 |
| Value | Count | Frequency (%) |
| / | 10195 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 310530 | |
| Common | 161830 |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 29720 | 9.6% |
| e | 24774 | 8.0% |
| r | 22874 | 7.4% |
| a | 22075 | 7.1% |
| S | 19703 | 6.3% |
| n | 18967 | 6.1% |
| o | 17069 | 5.5% |
| l | 16609 | 5.3% |
| d | 14673 | 4.7% |
| i | 13090 | 4.2% |
| Other values (42) | 110976 |
| Value | Count | Frequency (%) |
| 70199 | ||
| 1 | 18081 | 11.2% |
| 2 | 12537 | 7.7% |
| / | 10195 | 6.3% |
| 3 | 9751 | 6.0% |
| 4 | 7861 | 4.9% |
| 5 | 6911 | 4.3% |
| 6 | 5934 | 3.7% |
| 7 | 5454 | 3.4% |
| 0 | 5328 | 3.3% |
| Other values (2) | 9579 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 472360 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 70199 | 14.9% | |
| t | 29720 | 6.3% |
| e | 24774 | 5.2% |
| r | 22874 | 4.8% |
| a | 22075 | 4.7% |
| S | 19703 | 4.2% |
| n | 18967 | 4.0% |
| 1 | 18081 | 3.8% |
| o | 17069 | 3.6% |
| l | 16609 | 3.5% |
| Other values (54) | 212289 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.031012422 |
|---|---|
| Minimum | 1 |
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 16 |
| Range | 15 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.9699329349 |
|---|---|
| Coefficient of variation (CV) | 0.320002956 |
| Kurtosis | 2.511708654 |
| Mean | 3.031012422 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.4990968808 |
| Sum | 105652 |
| Variance | 0.9407698982 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 15084 | |
| 2 | 8332 | |
| 4 | 7956 | |
| 5 | 1737 | 5.0% |
| 1 | 1479 | 4.2% |
| 6 | 204 | 0.6% |
| 7 | 32 | 0.1% |
| 8 | 19 | 0.1% |
| 10 | 6 | < 0.1% |
| 9 | 4 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1479 | 4.2% |
| 2 | 8332 | |
| 3 | 15084 | |
| 4 | 7956 | |
| 5 | 1737 | 5.0% |
| Value | Count | Frequency (%) |
| 16 | 1 | < 0.1% |
| 12 | 3 | < 0.1% |
| 10 | 6 | < 0.1% |
| 9 | 4 | < 0.1% |
| 8 | 19 |
TYPE
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 272.4 KiB |
| h | |
|---|---|
| u | |
| t |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 34857 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | h |
|---|---|
| 2nd row | h |
| 3rd row | h |
| 4th row | u |
| 5th row | h |
| Value | Count | Frequency (%) |
| h | 23980 | |
| u | 7297 | 20.9% |
| t | 3580 | 10.3% |
| Value | Count | Frequency (%) |
| h | 23980 | |
| u | 7297 | 20.9% |
| t | 3580 | 10.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 23980 | |
| u | 7297 | 20.9% |
| t | 3580 | 10.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34857 |
Most frequent character per category
| Value | Count | Frequency (%) |
| h | 23980 | |
| u | 7297 | 20.9% |
| t | 3580 | 10.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34857 |
Most frequent character per script
| Value | Count | Frequency (%) |
| h | 23980 | |
| u | 7297 | 20.9% |
| t | 3580 | 10.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34857 |
Most frequent character per block
| Value | Count | Frequency (%) |
| h | 23980 | |
| u | 7297 | 20.9% |
| t | 3580 | 10.3% |
| Distinct | 2871 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 7610 |
| Missing (%) | 21.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1050173.345 |
|---|---|
| Minimum | 85000 |
| Maximum | 11200000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 85000 |
|---|---|
| 5-th percentile | 415000 |
| Q1 | 635000 |
| median | 870000 |
| Q3 | 1295000 |
| 95-th percentile | 2250000 |
| Maximum | 11200000 |
| Range | 11115000 |
| Interquartile range (IQR) | 660000 |
Descriptive statistics
| Standard deviation | 641467.1301 |
|---|---|
| Coefficient of variation (CV) | 0.6108202357 |
| Kurtosis | 13.09720052 |
| Mean | 1050173.345 |
| Median Absolute Deviation (MAD) | 290000 |
| Skewness | 2.588969341 |
| Sum | 2.861407313 × 1010 |
| Variance | 4.11480079 × 1011 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 600000 | 235 | 0.7% |
| 1100000 | 235 | 0.7% |
| 650000 | 219 | 0.6% |
| 800000 | 217 | 0.6% |
| 1300000 | 210 | 0.6% |
| 1000000 | 205 | 0.6% |
| 1200000 | 204 | 0.6% |
| 700000 | 197 | 0.6% |
| 750000 | 194 | 0.6% |
| 900000 | 191 | 0.5% |
| Other values (2861) | 25140 | |
| (Missing) | 7610 | 21.8% |
| Value | Count | Frequency (%) |
| 85000 | 1 | |
| 112000 | 1 | |
| 121000 | 1 | |
| 131000 | 1 | |
| 145000 | 2 |
| Value | Count | Frequency (%) |
| 11200000 | 1 | |
| 9000000 | 1 | |
| 8000000 | 1 | |
| 7650000 | 1 | |
| 7000000 | 1 |
METHOD
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 272.4 KiB |
| S | |
|---|---|
| SP | |
| PI | |
| VB | |
| SN | 1317 |
| Other values (4) | 743 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.428608314 |
| Min length | 1 |
Characters and Unicode
| Total characters | 49797 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SS |
|---|---|
| 2nd row | S |
| 3rd row | S |
| 4th row | VB |
| 5th row | SP |
| Value | Count | Frequency (%) |
| S | 19744 | |
| SP | 5095 | 14.6% |
| PI | 4850 | 13.9% |
| VB | 3108 | 8.9% |
| SN | 1317 | 3.8% |
| PN | 308 | 0.9% |
| SA | 226 | 0.6% |
| W | 173 | 0.5% |
| SS | 36 | 0.1% |
| Value | Count | Frequency (%) |
| s | 19744 | |
| sp | 5095 | 14.6% |
| pi | 4850 | 13.9% |
| vb | 3108 | 8.9% |
| sn | 1317 | 3.8% |
| pn | 308 | 0.9% |
| sa | 226 | 0.6% |
| w | 173 | 0.5% |
| ss | 36 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 26454 | |
| P | 10253 | 20.6% |
| I | 4850 | 9.7% |
| V | 3108 | 6.2% |
| B | 3108 | 6.2% |
| N | 1625 | 3.3% |
| A | 226 | 0.5% |
| W | 173 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 49797 |
Most frequent character per category
| Value | Count | Frequency (%) |
| S | 26454 | |
| P | 10253 | 20.6% |
| I | 4850 | 9.7% |
| V | 3108 | 6.2% |
| B | 3108 | 6.2% |
| N | 1625 | 3.3% |
| A | 226 | 0.5% |
| W | 173 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49797 |
Most frequent character per script
| Value | Count | Frequency (%) |
| S | 26454 | |
| P | 10253 | 20.6% |
| I | 4850 | 9.7% |
| V | 3108 | 6.2% |
| B | 3108 | 6.2% |
| N | 1625 | 3.3% |
| A | 226 | 0.5% |
| W | 173 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49797 |
Most frequent character per block
| Value | Count | Frequency (%) |
| S | 26454 | |
| P | 10253 | 20.6% |
| I | 4850 | 9.7% |
| V | 3108 | 6.2% |
| B | 3108 | 6.2% |
| N | 1625 | 3.3% |
| A | 226 | 0.5% |
| W | 173 | 0.3% |
| Distinct | 388 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 272.4 KiB |
| Jellis | |
|---|---|
| Nelson | |
| Barry | |
| hockingstuart | |
| Marshall | 2027 |
| Other values (383) |
Length
| Max length | 27 |
|---|---|
| Median length | 6 |
| Mean length | 6.291533982 |
| Min length | 1 |
Characters and Unicode
| Total characters | 219304 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 107 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Jellis |
|---|---|
| 2nd row | Biggin |
| 3rd row | Biggin |
| 4th row | Rounds |
| 5th row | Biggin |
| Value | Count | Frequency (%) |
| Jellis | 3359 | 9.6% |
| Nelson | 3236 | 9.3% |
| Barry | 3235 | 9.3% |
| hockingstuart | 2623 | 7.5% |
| Marshall | 2027 | 5.8% |
| Ray | 1950 | 5.6% |
| Buxton | 1868 | 5.4% |
| Biggin | 897 | 2.6% |
| Fletchers | 861 | 2.5% |
| Woodards | 714 | 2.0% |
| Other values (378) | 14087 |
| Value | Count | Frequency (%) |
| jellis | 3359 | 9.6% |
| nelson | 3236 | 9.3% |
| barry | 3235 | 9.3% |
| hockingstuart | 2623 | 7.5% |
| marshall | 2027 | 5.8% |
| ray | 1950 | 5.6% |
| buxton | 1868 | 5.4% |
| biggin | 897 | 2.6% |
| fletchers | 861 | 2.5% |
| woodards | 714 | 2.0% |
| Other values (373) | 14087 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 19452 | 8.9% |
| a | 19036 | 8.7% |
| r | 18356 | 8.4% |
| s | 16592 | 7.6% |
| e | 15693 | 7.2% |
| o | 13550 | 6.2% |
| n | 12384 | 5.6% |
| i | 11972 | 5.5% |
| t | 10132 | 4.6% |
| B | 7433 | 3.4% |
| Other values (48) | 74704 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 181580 | |
| Uppercase Letter | 36864 | 16.8% |
| Other Punctuation | 542 | 0.2% |
| Decimal Number | 318 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| B | 7433 | |
| J | 4047 | |
| N | 4009 | |
| R | 3806 | |
| M | 3715 | |
| G | 1631 | 4.4% |
| W | 1460 | 4.0% |
| H | 1368 | 3.7% |
| P | 1149 | 3.1% |
| F | 1068 | 2.9% |
| Other values (16) | 7178 |
| Value | Count | Frequency (%) |
| l | 19452 | |
| a | 19036 | |
| r | 18356 | |
| s | 16592 | |
| e | 15693 | |
| o | 13550 | 7.5% |
| n | 12384 | 6.8% |
| i | 11972 | 6.6% |
| t | 10132 | 5.6% |
| h | 7348 | 4.0% |
| Other values (15) | 37065 |
| Value | Count | Frequency (%) |
| ' | 318 | |
| . | 101 | 18.6% |
| & | 72 | 13.3% |
| / | 39 | 7.2% |
| @ | 12 | 2.2% |
| Value | Count | Frequency (%) |
| 2 | 159 | |
| 1 | 159 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 218444 | |
| Common | 860 | 0.4% |
Most frequent character per script
| Value | Count | Frequency (%) |
| l | 19452 | 8.9% |
| a | 19036 | 8.7% |
| r | 18356 | 8.4% |
| s | 16592 | 7.6% |
| e | 15693 | 7.2% |
| o | 13550 | 6.2% |
| n | 12384 | 5.7% |
| i | 11972 | 5.5% |
| t | 10132 | 4.6% |
| B | 7433 | 3.4% |
| Other values (41) | 73844 |
| Value | Count | Frequency (%) |
| ' | 318 | |
| 2 | 159 | |
| 1 | 159 | |
| . | 101 | 11.7% |
| & | 72 | 8.4% |
| / | 39 | 4.5% |
| @ | 12 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 219304 |
Most frequent character per block
| Value | Count | Frequency (%) |
| l | 19452 | 8.9% |
| a | 19036 | 8.7% |
| r | 18356 | 8.4% |
| s | 16592 | 7.6% |
| e | 15693 | 7.2% |
| o | 13550 | 6.2% |
| n | 12384 | 5.6% |
| i | 11972 | 5.5% |
| t | 10132 | 4.6% |
| B | 7433 | 3.4% |
| Other values (48) | 74704 |
| Distinct | 78 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 272.4 KiB |
| 28/10/2017 | 1119 |
|---|---|
| 17/03/2018 | 970 |
| 24/02/2018 | 941 |
| 9/12/2017 | 927 |
| 25/11/2017 | 902 |
| Other values (73) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.714748831 |
| Min length | 9 |
Characters and Unicode
| Total characters | 338627 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3/09/2016 |
|---|---|
| 2nd row | 3/12/2016 |
| 3rd row | 4/02/2016 |
| 4th row | 4/02/2016 |
| 5th row | 4/03/2017 |
| Value | Count | Frequency (%) |
| 28/10/2017 | 1119 | 3.2% |
| 17/03/2018 | 970 | 2.8% |
| 24/02/2018 | 941 | 2.7% |
| 9/12/2017 | 927 | 2.7% |
| 25/11/2017 | 902 | 2.6% |
| 18/11/2017 | 866 | 2.5% |
| 3/03/2018 | 846 | 2.4% |
| 6/01/2018 | 787 | 2.3% |
| 27/05/2017 | 770 | 2.2% |
| 23/09/2017 | 742 | 2.1% |
| Other values (68) | 25987 |
| Value | Count | Frequency (%) |
| 28/10/2017 | 1119 | 3.2% |
| 17/03/2018 | 970 | 2.8% |
| 24/02/2018 | 941 | 2.7% |
| 9/12/2017 | 927 | 2.7% |
| 25/11/2017 | 902 | 2.6% |
| 18/11/2017 | 866 | 2.5% |
| 3/03/2018 | 846 | 2.4% |
| 6/01/2018 | 787 | 2.3% |
| 27/05/2017 | 770 | 2.2% |
| 23/09/2017 | 742 | 2.1% |
| Other values (68) | 25987 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 69714 | |
| 0 | 65588 | |
| 1 | 64879 | |
| 2 | 53914 | |
| 7 | 28339 | |
| 6 | 16876 | 5.0% |
| 8 | 12653 | 3.7% |
| 3 | 7976 | 2.4% |
| 9 | 7436 | 2.2% |
| 5 | 5760 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 268913 | |
| Other Punctuation | 69714 | 20.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 65588 | |
| 1 | 64879 | |
| 2 | 53914 | |
| 7 | 28339 | |
| 6 | 16876 | 6.3% |
| 8 | 12653 | 4.7% |
| 3 | 7976 | 3.0% |
| 9 | 7436 | 2.8% |
| 5 | 5760 | 2.1% |
| 4 | 5492 | 2.0% |
| Value | Count | Frequency (%) |
| / | 69714 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 338627 |
Most frequent character per script
| Value | Count | Frequency (%) |
| / | 69714 | |
| 0 | 65588 | |
| 1 | 64879 | |
| 2 | 53914 | |
| 7 | 28339 | |
| 6 | 16876 | 5.0% |
| 8 | 12653 | 3.7% |
| 3 | 7976 | 2.4% |
| 9 | 7436 | 2.2% |
| 5 | 5760 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 338627 |
Most frequent character per block
| Value | Count | Frequency (%) |
| / | 69714 | |
| 0 | 65588 | |
| 1 | 64879 | |
| 2 | 53914 | |
| 7 | 28339 | |
| 6 | 16876 | 5.0% |
| 8 | 12653 | 3.7% |
| 3 | 7976 | 2.4% |
| 9 | 7436 | 2.2% |
| 5 | 5760 | 1.7% |
DISTANCE
Real number (ℝ≥0)
| Distinct | 215 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.18492942 |
|---|---|
| Minimum | 0 |
| Maximum | 48.1 |
| Zeros | 77 |
| Zeros (%) | 0.2% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.7 |
| Q1 | 6.4 |
| median | 10.3 |
| Q3 | 14 |
| 95-th percentile | 24.7 |
| Maximum | 48.1 |
| Range | 48.1 |
| Interquartile range (IQR) | 7.6 |
Descriptive statistics
| Standard deviation | 6.788892456 |
|---|---|
| Coefficient of variation (CV) | 0.6069678403 |
| Kurtosis | 3.585924276 |
| Mean | 11.18492942 |
| Median Absolute Deviation (MAD) | 3.85 |
| Skewness | 1.503585816 |
| Sum | 389861.9 |
| Variance | 46.08906078 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 11.2 | 1420 | 4.1% |
| 13.8 | 681 | 2.0% |
| 9.2 | 665 | 1.9% |
| 7.8 | 662 | 1.9% |
| 10.5 | 660 | 1.9% |
| 8.4 | 604 | 1.7% |
| 4.6 | 585 | 1.7% |
| 14.7 | 566 | 1.6% |
| 5.2 | 565 | 1.6% |
| 11.4 | 521 | 1.5% |
| Other values (205) | 27927 |
| Value | Count | Frequency (%) |
| 0 | 77 | |
| 0.7 | 29 | 0.1% |
| 1.2 | 47 | |
| 1.3 | 30 | 0.1% |
| 1.4 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 48.1 | 6 | < 0.1% |
| 47.4 | 7 | < 0.1% |
| 47.3 | 20 | |
| 45.9 | 33 | |
| 45.2 | 2 | < 0.1% |
POSTCODE
Real number (ℝ≥0)
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3116.062859 |
|---|---|
| Minimum | 3000 |
| Maximum | 3978 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 3000 |
|---|---|
| 5-th percentile | 3015 |
| Q1 | 3051 |
| median | 3103 |
| Q3 | 3156 |
| 95-th percentile | 3204 |
| Maximum | 3978 |
| Range | 978 |
| Interquartile range (IQR) | 105 |
Descriptive statistics
| Standard deviation | 109.0239027 |
|---|---|
| Coefficient of variation (CV) | 0.03498770971 |
| Kurtosis | 22.78373808 |
| Mean | 3116.062859 |
| Median Absolute Deviation (MAD) | 52 |
| Skewness | 4.018785705 |
| Sum | 108613487 |
| Variance | 11886.21137 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3073 | 844 | 2.4% |
| 3046 | 638 | 1.8% |
| 3020 | 617 | 1.8% |
| 3121 | 612 | 1.8% |
| 3165 | 583 | 1.7% |
| 3058 | 556 | 1.6% |
| 3040 | 535 | 1.5% |
| 3204 | 518 | 1.5% |
| 3163 | 508 | 1.5% |
| 3012 | 497 | 1.4% |
| Other values (201) | 28948 |
| Value | Count | Frequency (%) |
| 3000 | 204 | |
| 3002 | 59 | 0.2% |
| 3003 | 66 | 0.2% |
| 3006 | 76 | 0.2% |
| 3008 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 3978 | 5 | < 0.1% |
| 3977 | 33 | |
| 3976 | 7 | < 0.1% |
| 3975 | 2 | < 0.1% |
| 3910 | 18 |
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 8217 |
| Missing (%) | 23.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.084647147 |
|---|---|
| Minimum | 0 |
| Maximum | 30 |
| Zeros | 17 |
| Zeros (%) | < 0.1% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 30 |
| Range | 30 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.9806897285 |
|---|---|
| Coefficient of variation (CV) | 0.3179260647 |
| Kurtosis | 26.80745531 |
| Mean | 3.084647147 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.406365679 |
| Sum | 82175 |
| Variance | 0.9617523437 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 11881 | |
| 4 | 6348 | |
| 2 | 5777 | |
| 5 | 1427 | 4.1% |
| 1 | 966 | 2.8% |
| 6 | 168 | 0.5% |
| 7 | 30 | 0.1% |
| 0 | 17 | < 0.1% |
| 8 | 13 | < 0.1% |
| 9 | 5 | < 0.1% |
| Other values (5) | 8 | < 0.1% |
| (Missing) | 8217 |
| Value | Count | Frequency (%) |
| 0 | 17 | < 0.1% |
| 1 | 966 | 2.8% |
| 2 | 5777 | |
| 3 | 11881 | |
| 4 | 6348 |
| Value | Count | Frequency (%) |
| 30 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 10 | 4 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8226 |
| Missing (%) | 23.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.624798168 |
|---|---|
| Minimum | 0 |
| Maximum | 12 |
| Zeros | 46 |
| Zeros (%) | 0.1% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7242120115 |
|---|---|
| Coefficient of variation (CV) | 0.4457242911 |
| Kurtosis | 4.861008943 |
| Mean | 1.624798168 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.356293032 |
| Sum | 43270 |
| Variance | 0.5244830376 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 12969 | |
| 2 | 11064 | |
| 3 | 2181 | 6.3% |
| 4 | 269 | 0.8% |
| 5 | 77 | 0.2% |
| 0 | 46 | 0.1% |
| 6 | 16 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 3 | < 0.1% |
| 9 | 1 | < 0.1% |
| (Missing) | 8226 |
| Value | Count | Frequency (%) |
| 0 | 46 | 0.1% |
| 1 | 12969 | |
| 2 | 11064 | |
| 3 | 2181 | 6.3% |
| 4 | 269 | 0.8% |
| Value | Count | Frequency (%) |
| 12 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 3 | < 0.1% |
| 7 | 4 | < 0.1% |
| 6 | 16 |
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 8728 |
| Missing (%) | 25.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.728845344 |
|---|---|
| Minimum | 0 |
| Maximum | 26 |
| Zeros | 1631 |
| Zeros (%) | 4.7% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.010770785 |
|---|---|
| Coefficient of variation (CV) | 0.5846507837 |
| Kurtosis | 20.85932625 |
| Mean | 1.728845344 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.09517618 |
| Sum | 45173 |
| Variance | 1.021657581 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 12214 | |
| 1 | 9164 | |
| 0 | 1631 | 4.7% |
| 3 | 1606 | 4.6% |
| 4 | 1161 | 3.3% |
| 5 | 151 | 0.4% |
| 6 | 140 | 0.4% |
| 7 | 25 | 0.1% |
| 8 | 23 | 0.1% |
| 10 | 6 | < 0.1% |
| Other values (5) | 8 | < 0.1% |
| (Missing) | 8728 |
| Value | Count | Frequency (%) |
| 0 | 1631 | 4.7% |
| 1 | 9164 | |
| 2 | 12214 | |
| 3 | 1606 | 4.6% |
| 4 | 1161 | 3.3% |
| Value | Count | Frequency (%) |
| 26 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 11 | 2 | < 0.1% |
| 10 | 6 |
| Distinct | 1684 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 11810 |
| Missing (%) | 33.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 593.5989934 |
|---|---|
| Minimum | 0 |
| Maximum | 433014 |
| Zeros | 2437 |
| Zeros (%) | 7.0% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 224 |
| median | 521 |
| Q3 | 670 |
| 95-th percentile | 1001 |
| Maximum | 433014 |
| Range | 433014 |
| Interquartile range (IQR) | 446 |
Descriptive statistics
| Standard deviation | 3398.841946 |
|---|---|
| Coefficient of variation (CV) | 5.725821614 |
| Kurtosis | 11580.16251 |
| Mean | 593.5989934 |
| Median Absolute Deviation (MAD) | 210 |
| Skewness | 96.02231136 |
| Sum | 13680676 |
| Variance | 11552126.58 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2437 | 7.0% |
| 650 | 204 | 0.6% |
| 697 | 123 | 0.4% |
| 585 | 97 | 0.3% |
| 700 | 86 | 0.2% |
| 604 | 84 | 0.2% |
| 534 | 81 | 0.2% |
| 696 | 80 | 0.2% |
| 652 | 68 | 0.2% |
| 600 | 68 | 0.2% |
| Other values (1674) | 19719 | |
| (Missing) | 11810 |
| Value | Count | Frequency (%) |
| 0 | 2437 | |
| 1 | 3 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 433014 | 1 | |
| 146699 | 1 | |
| 89030 | 1 | |
| 80000 | 1 | |
| 76000 | 1 |
| Distinct | 740 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 21115 |
| Missing (%) | 60.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 160.2564004 |
|---|---|
| Minimum | 0 |
| Maximum | 44515 |
| Zeros | 76 |
| Zeros (%) | 0.2% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 56 |
| Q1 | 102 |
| median | 136 |
| Q3 | 188 |
| 95-th percentile | 310 |
| Maximum | 44515 |
| Range | 44515 |
| Interquartile range (IQR) | 86 |
Descriptive statistics
| Standard deviation | 401.2670601 |
|---|---|
| Coefficient of variation (CV) | 2.50390661 |
| Kurtosis | 10877.52575 |
| Mean | 160.2564004 |
| Median Absolute Deviation (MAD) | 41 |
| Skewness | 99.13257937 |
| Sum | 2202243.454 |
| Variance | 161015.2535 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 120 | 185 | 0.5% |
| 100 | 161 | 0.5% |
| 110 | 159 | 0.5% |
| 130 | 153 | 0.4% |
| 115 | 149 | 0.4% |
| 140 | 142 | 0.4% |
| 150 | 136 | 0.4% |
| 112 | 123 | 0.4% |
| 160 | 123 | 0.4% |
| 125 | 119 | 0.3% |
| Other values (730) | 12292 | |
| (Missing) | 21115 |
| Value | Count | Frequency (%) |
| 0 | 76 | |
| 0.01 | 1 | < 0.1% |
| 1 | 15 | < 0.1% |
| 2 | 20 | 0.1% |
| 3 | 25 | 0.1% |
| Value | Count | Frequency (%) |
| 44515 | 1 | |
| 6791 | 1 | |
| 6178 | 1 | |
| 4645 | 1 | |
| 3647 | 1 |
| Distinct | 160 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 19306 |
| Missing (%) | 55.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1965.289885 |
|---|---|
| Minimum | 1196 |
| Maximum | 2106 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 1196 |
|---|---|
| 5-th percentile | 1900 |
| Q1 | 1940 |
| median | 1970 |
| Q3 | 2000 |
| 95-th percentile | 2013 |
| Maximum | 2106 |
| Range | 910 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 37.32817802 |
|---|---|
| Coefficient of variation (CV) | 0.01899372622 |
| Kurtosis | 10.89861685 |
| Mean | 1965.289885 |
| Median Absolute Deviation (MAD) | 30 |
| Skewness | -1.080913147 |
| Sum | 30562223 |
| Variance | 1393.392875 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1970 | 1490 | 4.3% |
| 1960 | 1260 | 3.6% |
| 1950 | 1089 | 3.1% |
| 1980 | 726 | 2.1% |
| 1900 | 606 | 1.7% |
| 2000 | 571 | 1.6% |
| 1920 | 545 | 1.6% |
| 1930 | 531 | 1.5% |
| 1910 | 460 | 1.3% |
| 1890 | 444 | 1.3% |
| Other values (150) | 7829 | |
| (Missing) | 19306 |
| Value | Count | Frequency (%) |
| 1196 | 1 | < 0.1% |
| 1800 | 1 | < 0.1% |
| 1820 | 1 | < 0.1% |
| 1830 | 1 | < 0.1% |
| 1850 | 4 |
| Value | Count | Frequency (%) |
| 2106 | 1 | < 0.1% |
| 2019 | 1 | < 0.1% |
| 2018 | 4 | < 0.1% |
| 2017 | 82 | |
| 2016 | 130 |
COUNCILAREA
Categorical
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 272.4 KiB |
| Boroondara City Council | |
|---|---|
| Darebin City Council | |
| Moreland City Council | 2122 |
| Glen Eira City Council | 2006 |
| Melbourne City Council | 1952 |
| Other values (28) |
Length
| Max length | 30 |
|---|---|
| Median length | 22 |
| Mean length | 21.73417685 |
| Min length | 17 |
Characters and Unicode
| Total characters | 757523 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yarra City Council |
|---|---|
| 2nd row | Yarra City Council |
| 3rd row | Yarra City Council |
| 4th row | Yarra City Council |
| 5th row | Yarra City Council |
| Value | Count | Frequency (%) |
| Boroondara City Council | 3675 | 10.5% |
| Darebin City Council | 2851 | 8.2% |
| Moreland City Council | 2122 | 6.1% |
| Glen Eira City Council | 2006 | 5.8% |
| Melbourne City Council | 1952 | 5.6% |
| Banyule City Council | 1861 | 5.3% |
| Moonee Valley City Council | 1791 | 5.1% |
| Bayside City Council | 1764 | 5.1% |
| Brimbank City Council | 1593 | 4.6% |
| Monash City Council | 1466 | 4.2% |
| Other values (23) | 13773 |
| Value | Count | Frequency (%) |
| council | 34854 | |
| city | 34550 | |
| boroondara | 3675 | 3.3% |
| darebin | 2851 | 2.6% |
| moreland | 2122 | 1.9% |
| glen | 2006 | 1.8% |
| eira | 2006 | 1.8% |
| melbourne | 1952 | 1.8% |
| banyule | 1861 | 1.7% |
| valley | 1791 | 1.6% |
| Other values (31) | 23375 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 87034 | |
| 76189 | ||
| n | 72285 | |
| C | 69621 | 9.2% |
| o | 66378 | 8.8% |
| l | 50280 | 6.6% |
| y | 43159 | 5.7% |
| t | 42811 | 5.7% |
| u | 39969 | 5.3% |
| c | 34920 | 4.6% |
| Other values (27) | 174877 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 570291 | |
| Uppercase Letter | 111043 | 14.7% |
| Space Separator | 76189 | 10.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| i | 87034 | |
| n | 72285 | |
| o | 66378 | |
| l | 50280 | |
| y | 43159 | |
| t | 42811 | |
| u | 39969 | |
| c | 34920 | |
| a | 33706 | 5.9% |
| r | 27026 | 4.7% |
| Other values (10) | 72723 |
| Value | Count | Frequency (%) |
| C | 69621 | |
| M | 10699 | 9.6% |
| B | 9835 | 8.9% |
| D | 3165 | 2.9% |
| P | 2560 | 2.3% |
| G | 2320 | 2.1% |
| H | 2156 | 1.9% |
| W | 2070 | 1.9% |
| E | 2006 | 1.8% |
| V | 1791 | 1.6% |
| Other values (6) | 4820 | 4.3% |
| Value | Count | Frequency (%) |
| 76189 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 681334 | |
| Common | 76189 | 10.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| i | 87034 | |
| n | 72285 | |
| C | 69621 | |
| o | 66378 | |
| l | 50280 | 7.4% |
| y | 43159 | 6.3% |
| t | 42811 | 6.3% |
| u | 39969 | 5.9% |
| c | 34920 | 5.1% |
| a | 33706 | 4.9% |
| Other values (26) | 141171 |
| Value | Count | Frequency (%) |
| 76189 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 757523 |
Most frequent character per block
| Value | Count | Frequency (%) |
| i | 87034 | |
| 76189 | ||
| n | 72285 | |
| C | 69621 | 9.2% |
| o | 66378 | 8.8% |
| l | 50280 | 6.6% |
| y | 43159 | 5.7% |
| t | 42811 | 5.7% |
| u | 39969 | 5.3% |
| c | 34920 | 4.6% |
| Other values (27) | 174877 |
| Distinct | 13402 |
|---|---|
| Distinct (%) | 49.9% |
| Missing | 7976 |
| Missing (%) | 22.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -37.8106343 |
|---|---|
| Minimum | -38.19043 |
| Maximum | -37.3902 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | -38.19043 |
|---|---|
| 5-th percentile | -37.9485 |
| Q1 | -37.86295 |
| median | -37.8076 |
| Q3 | -37.7541 |
| 95-th percentile | -37.67519 |
| Maximum | -37.3902 |
| Range | 0.80023 |
| Interquartile range (IQR) | 0.10885 |
Descriptive statistics
| Standard deviation | 0.09027890451 |
|---|---|
| Coefficient of variation (CV) | -0.002387659086 |
| Kurtosis | 1.544527049 |
| Mean | -37.8106343 |
| Median Absolute Deviation (MAD) | 0.05448 |
| Skewness | -0.2576614223 |
| Sum | -1016387.66 |
| Variance | 0.008150280599 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| -37.8361 | 25 | 0.1% |
| -37.8424 | 22 | 0.1% |
| -37.8198 | 20 | 0.1% |
| -37.7956 | 20 | 0.1% |
| -37.8414 | 18 | 0.1% |
| -37.7969 | 18 | 0.1% |
| -37.7941 | 17 | < 0.1% |
| -37.8536 | 17 | < 0.1% |
| -37.7634 | 17 | < 0.1% |
| -37.7818 | 16 | < 0.1% |
| Other values (13392) | 26691 | |
| (Missing) | 7976 | 22.9% |
| Value | Count | Frequency (%) |
| -38.19043 | 1 | |
| -38.1856 | 1 | |
| -38.18463 | 1 | |
| -38.18418 | 1 | |
| -38.18415 | 1 |
| Value | Count | Frequency (%) |
| -37.3902 | 1 | |
| -37.3951 | 1 | |
| -37.3978 | 1 | |
| -37.39946 | 1 | |
| -37.40349 | 1 |
| Distinct | 14524 |
|---|---|
| Distinct (%) | 54.0% |
| Missing | 7976 |
| Missing (%) | 22.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 145.0018511 |
|---|---|
| Minimum | 144.42379 |
| Maximum | 145.52635 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 144.42379 |
|---|---|
| 5-th percentile | 144.80008 |
| Q1 | 144.9335 |
| median | 145.0078 |
| Q3 | 145.0719 |
| 95-th percentile | 145.1877 |
| Maximum | 145.52635 |
| Range | 1.10256 |
| Interquartile range (IQR) | 0.1384 |
Descriptive statistics
| Standard deviation | 0.1201687692 |
|---|---|
| Coefficient of variation (CV) | 0.0008287395521 |
| Kurtosis | 1.545947474 |
| Mean | 145.0018511 |
| Median Absolute Deviation (MAD) | 0.06832 |
| Skewness | -0.3948800169 |
| Sum | 3897794.76 |
| Variance | 0.01444053308 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 144.9966 | 21 | 0.1% |
| 144.985 | 17 | < 0.1% |
| 145.0104 | 17 | < 0.1% |
| 144.991 | 17 | < 0.1% |
| 144.9911 | 16 | < 0.1% |
| 145.0001 | 16 | < 0.1% |
| 145.0243 | 16 | < 0.1% |
| 144.9679 | 16 | < 0.1% |
| 144.9974 | 15 | < 0.1% |
| 144.9999 | 15 | < 0.1% |
| Other values (14514) | 26715 | |
| (Missing) | 7976 | 22.9% |
| Value | Count | Frequency (%) |
| 144.42379 | 1 | |
| 144.43162 | 1 | |
| 144.43181 | 1 | |
| 144.4394 | 1 | |
| 144.44051 | 1 |
| Value | Count | Frequency (%) |
| 145.52635 | 1 | |
| 145.5237 | 1 | |
| 145.51137 | 1 | |
| 145.48985 | 1 | |
| 145.48273 | 1 |
REGIONNAME
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 272.4 KiB |
| Southern Metropolitan | |
|---|---|
| Northern Metropolitan | |
| Western Metropolitan | |
| Eastern Metropolitan | |
| South-Eastern Metropolitan | |
| Other values (3) | 546 |
Length
| Max length | 26 |
|---|---|
| Median length | 21 |
| Mean length | 20.85631491 |
| Min length | 16 |
Characters and Unicode
| Total characters | 726926 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Northern Metropolitan |
|---|---|
| 2nd row | Northern Metropolitan |
| 3rd row | Northern Metropolitan |
| 4th row | Northern Metropolitan |
| 5th row | Northern Metropolitan |
| Value | Count | Frequency (%) |
| Southern Metropolitan | 11836 | |
| Northern Metropolitan | 9557 | |
| Western Metropolitan | 6799 | |
| Eastern Metropolitan | 4377 | 12.6% |
| South-Eastern Metropolitan | 1739 | 5.0% |
| Eastern Victoria | 228 | 0.7% |
| Northern Victoria | 203 | 0.6% |
| Western Victoria | 115 | 0.3% |
| (Missing) | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| metropolitan | 34308 | |
| southern | 11836 | 17.0% |
| northern | 9760 | 14.0% |
| western | 6914 | 9.9% |
| eastern | 4605 | 6.6% |
| south-eastern | 1739 | 2.5% |
| victoria | 546 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 105755 | |
| o | 92497 | |
| r | 79468 | |
| e | 76076 | |
| n | 69162 | |
| a | 41198 | 5.7% |
| i | 35400 | 4.9% |
| 34854 | 4.8% | |
| M | 34308 | 4.7% |
| p | 34308 | 4.7% |
| Other values (11) | 123900 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 618886 | |
| Uppercase Letter | 71447 | 9.8% |
| Space Separator | 34854 | 4.8% |
| Dash Punctuation | 1739 | 0.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| t | 105755 | |
| o | 92497 | |
| r | 79468 | |
| e | 76076 | |
| n | 69162 | |
| a | 41198 | 6.7% |
| i | 35400 | 5.7% |
| p | 34308 | 5.5% |
| l | 34308 | 5.5% |
| h | 23335 | 3.8% |
| Other values (3) | 27379 | 4.4% |
| Value | Count | Frequency (%) |
| M | 34308 | |
| S | 13575 | 19.0% |
| N | 9760 | 13.7% |
| W | 6914 | 9.7% |
| E | 6344 | 8.9% |
| V | 546 | 0.8% |
| Value | Count | Frequency (%) |
| 34854 |
| Value | Count | Frequency (%) |
| - | 1739 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 690333 | |
| Common | 36593 | 5.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 105755 | |
| o | 92497 | |
| r | 79468 | |
| e | 76076 | |
| n | 69162 | |
| a | 41198 | 6.0% |
| i | 35400 | 5.1% |
| M | 34308 | 5.0% |
| p | 34308 | 5.0% |
| l | 34308 | 5.0% |
| Other values (9) | 87853 |
| Value | Count | Frequency (%) |
| 34854 | ||
| - | 1739 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 726926 |
Most frequent character per block
| Value | Count | Frequency (%) |
| t | 105755 | |
| o | 92497 | |
| r | 79468 | |
| e | 76076 | |
| n | 69162 | |
| a | 41198 | 5.7% |
| i | 35400 | 4.9% |
| 34854 | 4.8% | |
| M | 34308 | 4.7% |
| p | 34308 | 4.7% |
| Other values (11) | 123900 |
PROPERTYCOUNT
Real number (ℝ≥0)
| Distinct | 342 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7572.888306 |
|---|---|
| Minimum | 83 |
| Maximum | 21650 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 272.4 KiB |
Quantile statistics
| Minimum | 83 |
|---|---|
| 5-th percentile | 2185 |
| Q1 | 4385 |
| median | 6763 |
| Q3 | 10412 |
| 95-th percentile | 15510 |
| Maximum | 21650 |
| Range | 21567 |
| Interquartile range (IQR) | 6027 |
Descriptive statistics
| Standard deviation | 4428.090313 |
|---|---|
| Coefficient of variation (CV) | 0.5847293839 |
| Kurtosis | 0.8906876388 |
| Mean | 7572.888306 |
| Median Absolute Deviation (MAD) | 2823 |
| Skewness | 0.9921002749 |
| Sum | 263945449 |
| Variance | 19607983.82 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 21650 | 844 | 2.4% |
| 8870 | 722 | 2.1% |
| 10969 | 583 | 1.7% |
| 14949 | 552 | 1.6% |
| 10412 | 491 | 1.4% |
| 14577 | 485 | 1.4% |
| 10331 | 467 | 1.3% |
| 10579 | 456 | 1.3% |
| 11918 | 444 | 1.3% |
| 14887 | 435 | 1.2% |
| Other values (332) | 29375 |
| Value | Count | Frequency (%) |
| 83 | 1 | < 0.1% |
| 121 | 1 | < 0.1% |
| 129 | 1 | < 0.1% |
| 242 | 1 | < 0.1% |
| 249 | 5 |
| Value | Count | Frequency (%) |
| 21650 | 844 | |
| 17496 | 204 | 0.6% |
| 17384 | 20 | 0.1% |
| 17093 | 47 | 0.1% |
| 17055 | 123 | 0.4% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| SUBURB | ADDRESS | ROOMS | TYPE | PRICE | METHOD | SELLERG | DATE | DISTANCE | POSTCODE | BEDROOM2 | BATHROOM | CAR | LANDSIZE | BUILDINGAREA | YEARBUILT | COUNCILAREA | LATTITUDE | LONGTITUDE | REGIONNAME | PROPERTYCOUNT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Abbotsford | 68 Studley St | 2 | h | NaN | SS | Jellis | 3/09/2016 | 2.5 | 3067.0 | 2.0 | 1.0 | 1.0 | 126.0 | NaN | NaN | Yarra City Council | -37.8014 | 144.9958 | Northern Metropolitan | 4019.0 |
| 1 | Abbotsford | 85 Turner St | 2 | h | 1480000.0 | S | Biggin | 3/12/2016 | 2.5 | 3067.0 | 2.0 | 1.0 | 1.0 | 202.0 | NaN | NaN | Yarra City Council | -37.7996 | 144.9984 | Northern Metropolitan | 4019.0 |
| 2 | Abbotsford | 25 Bloomburg St | 2 | h | 1035000.0 | S | Biggin | 4/02/2016 | 2.5 | 3067.0 | 2.0 | 1.0 | 0.0 | 156.0 | 79.0 | 1900.0 | Yarra City Council | -37.8079 | 144.9934 | Northern Metropolitan | 4019.0 |
| 3 | Abbotsford | 18/659 Victoria St | 3 | u | NaN | VB | Rounds | 4/02/2016 | 2.5 | 3067.0 | 3.0 | 2.0 | 1.0 | 0.0 | NaN | NaN | Yarra City Council | -37.8114 | 145.0116 | Northern Metropolitan | 4019.0 |
| 4 | Abbotsford | 5 Charles St | 3 | h | 1465000.0 | SP | Biggin | 4/03/2017 | 2.5 | 3067.0 | 3.0 | 2.0 | 0.0 | 134.0 | 150.0 | 1900.0 | Yarra City Council | -37.8093 | 144.9944 | Northern Metropolitan | 4019.0 |
| 5 | Abbotsford | 40 Federation La | 3 | h | 850000.0 | PI | Biggin | 4/03/2017 | 2.5 | 3067.0 | 3.0 | 2.0 | 1.0 | 94.0 | NaN | NaN | Yarra City Council | -37.7969 | 144.9969 | Northern Metropolitan | 4019.0 |
| 6 | Abbotsford | 55a Park St | 4 | h | 1600000.0 | VB | Nelson | 4/06/2016 | 2.5 | 3067.0 | 3.0 | 1.0 | 2.0 | 120.0 | 142.0 | 2014.0 | Yarra City Council | -37.8072 | 144.9941 | Northern Metropolitan | 4019.0 |
| 7 | Abbotsford | 16 Maugie St | 4 | h | NaN | SN | Nelson | 6/08/2016 | 2.5 | 3067.0 | 3.0 | 2.0 | 2.0 | 400.0 | 220.0 | 2006.0 | Yarra City Council | -37.7965 | 144.9965 | Northern Metropolitan | 4019.0 |
| 8 | Abbotsford | 53 Turner St | 2 | h | NaN | S | Biggin | 6/08/2016 | 2.5 | 3067.0 | 4.0 | 1.0 | 2.0 | 201.0 | NaN | 1900.0 | Yarra City Council | -37.7995 | 144.9974 | Northern Metropolitan | 4019.0 |
| 9 | Abbotsford | 99 Turner St | 2 | h | NaN | S | Collins | 6/08/2016 | 2.5 | 3067.0 | 3.0 | 2.0 | 1.0 | 202.0 | NaN | 1900.0 | Yarra City Council | -37.7996 | 144.9989 | Northern Metropolitan | 4019.0 |
Last rows
| SUBURB | ADDRESS | ROOMS | TYPE | PRICE | METHOD | SELLERG | DATE | DISTANCE | POSTCODE | BEDROOM2 | BATHROOM | CAR | LANDSIZE | BUILDINGAREA | YEARBUILT | COUNCILAREA | LATTITUDE | LONGTITUDE | REGIONNAME | PROPERTYCOUNT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 34847 | Wollert | 27 Birchmore Rd | 3 | h | 500000.0 | PI | Ray | 24/02/2018 | 25.5 | 3750.0 | 3.0 | 2.0 | 2.0 | 383.0 | 118.0 | 2016.0 | Whittlesea City Council | -37.61940 | 145.03951 | Northern Metropolitan | 2940.0 |
| 34848 | Wollert | 16 Gunther Wy | 4 | h | 621000.0 | S | hockingstuart | 24/02/2018 | 25.5 | 3750.0 | 4.0 | 2.0 | 2.0 | 375.0 | NaN | NaN | Whittlesea City Council | -37.61331 | 145.03412 | Northern Metropolitan | 2940.0 |
| 34849 | Wollert | 35 Kingscote Wy | 3 | h | 570000.0 | SP | RW | 24/02/2018 | 25.5 | 3750.0 | 3.0 | 2.0 | 2.0 | 404.0 | 158.0 | 2012.0 | Whittlesea City Council | -37.61031 | 145.03393 | Northern Metropolitan | 2940.0 |
| 34850 | Wollert | 15 Rockgarden Wy | 3 | h | NaN | SP | LJ | 24/02/2018 | 25.5 | 3750.0 | 3.0 | 2.0 | 2.0 | 268.0 | 135.0 | 2016.0 | Whittlesea City Council | -37.61094 | 145.04281 | Northern Metropolitan | 2940.0 |
| 34851 | Yarraville | 78 Bayview Rd | 3 | h | 1101000.0 | S | Jas | 24/02/2018 | 6.3 | 3013.0 | 3.0 | 1.0 | NaN | 288.0 | NaN | NaN | Maribyrnong City Council | -37.81095 | 144.88516 | Western Metropolitan | 6543.0 |
| 34852 | Yarraville | 13 Burns St | 4 | h | 1480000.0 | PI | Jas | 24/02/2018 | 6.3 | 3013.0 | 4.0 | 1.0 | 3.0 | 593.0 | NaN | NaN | Maribyrnong City Council | -37.81053 | 144.88467 | Western Metropolitan | 6543.0 |
| 34853 | Yarraville | 29A Murray St | 2 | h | 888000.0 | SP | Sweeney | 24/02/2018 | 6.3 | 3013.0 | 2.0 | 2.0 | 1.0 | 98.0 | 104.0 | 2018.0 | Maribyrnong City Council | -37.81551 | 144.88826 | Western Metropolitan | 6543.0 |
| 34854 | Yarraville | 147A Severn St | 2 | t | 705000.0 | S | Jas | 24/02/2018 | 6.3 | 3013.0 | 2.0 | 1.0 | 2.0 | 220.0 | 120.0 | 2000.0 | Maribyrnong City Council | -37.82286 | 144.87856 | Western Metropolitan | 6543.0 |
| 34855 | Yarraville | 12/37 Stephen St | 3 | h | 1140000.0 | SP | hockingstuart | 24/02/2018 | 6.3 | 3013.0 | NaN | NaN | NaN | NaN | NaN | NaN | Maribyrnong City Council | NaN | NaN | Western Metropolitan | 6543.0 |
| 34856 | Yarraville | 3 Tarrengower St | 2 | h | 1020000.0 | PI | RW | 24/02/2018 | 6.3 | 3013.0 | 2.0 | 1.0 | 0.0 | 250.0 | 103.0 | 1930.0 | Maribyrnong City Council | -37.81810 | 144.89351 | Western Metropolitan | 6543.0 |